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SUBSTITUTE SPECmCAHON 

METHOD AND APPARATUS FOR IMPROVED BOOKMARK AND 
mSTORIES ENTRY CREATION AND ACCESS 

BACKGROUND OF THE INVENTION 

1. Tcclmical Field: 



[0001] The present iixvesntioix relates generally to organizing and accessing entries in a web 
page list and in particular to organizing a web page list to easily locate and retrieve relevant 
pages based on content. Still more particularly, the present invention relates to employiog 
keywords and search terms in coxmectiojx with a web page list to faciUtate location and retrieval 
of relevant pages based on content. 

2. Description of the Related Art: 

[0002] Almost all hypertext systems, including web browsers, provide a facility listing 
particular sites on the Internet for quick, easy access. Such a fecility is normally referred to as a 
bookmaxk or "favorites" list, a history list, a hotUst, or "channels." These lists are typically 
displayed within a browser or other communications management utility (such as Windows™ 
Explorer) as a textual Ust of titles which may be selected ia "po^nit-and-click" fashion by the 
usen Often a user may add new entries to the list, alter the title or Unifoxni Resource Locator 
(URL) associated with a list entry, reorder entries, or delete one or more entries from the list. 
Some lists simply identify, on a rolling basis, a specified number of the last sites which a user 
visited using the browser, 
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[0003] Several organizational problems attend these lists as they are currently implemented 
and employed. First, by nature these lists offer minimal insight into the content represented by 
an entry. The problem i$ analogous to locating a desired book or volume within a library. To 
locate a particular volume in a library with only ten books is usually easy. Even if the cover or 
table of contents of each of the ten volumes must be inspected to detennine the content^ this 
takes only a few minutes, at most. Locating the same volume within a library of 300,000 books, 
however, can be much more difBcult and time consuming. 

[0004] Furthermore, lists of this type continually expand as the user's web browsing 
continues, quickly growing past a generally manageable size. Users often accxmiulate hundreds 
of bookmarks in their bookmark lists> while history lists can expand to thousands of entries 
dependitig on browser settings. Altiiough the linear nature of such lists demands organization to 
efficiently locate relevant entries, txo automated mechanism for organizing these lists exists, 
Users typically do not spend the time required to organize their bookmark lists^ and history list 
organization is constrained by whichever sorting mechanisms are supported by the browser (e.g., 
by URL, name, first visited, last visited, etc). Even if organized, however, the organization may 
not provide sufficient information to enable selection of the appropriate site based upon content, 

[0005] The problem is further complicated when a group of users shares a common set of 
bookmarks, such as when one user sends a bookmark file to another or when multiple users 
employ the same machine with a common or standard bookmark and history configuration. In 
either case, the user wishing to locate particular content from the Hst(s) may find the proper 
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bookmark or history entry only by manually searching recursively through folders of shared 
bookmarks, and in some cases actually retrieving the associated web pages to ascertain the 
content. 

[0006] The problem, as illustrated by the library analogy above, revolves around scale, but is 
also compounded by the iact that categorized bookmark or history entries may fit into more than 
one designated category. While "folders" within such lists are designed to provide a tisefiil 
classification system for the entries, the categorization for a bookmark or history entry which 
best identifies the linked content may change over time and may not be intuitive across all users. 

[0007] It would be desirable, therefore, to provide a mechanism allowing the user to 
. efficiently and automatically locate hotUst entries related to a topic of interest, without requiring 
substantiaj organizational overhead. It would fiirther be advantageous for the mechanism to 
automatically index a site to facilitate location of desired content when each bookmark or history 
entry is created. 
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SUMMARY OF THE INVENTION 

[0008] It is therefore one object of the preseat inventioii to improve orgaiiizatioti of and 
access to entries in a web page list. 

[0009] It is another object of the present invention to organize a web page list for easy 
location and retrieval of relevant pages based on content. 

[0010] It is yet another object of the present invention to employ keywords and search terxxxs 
in connection with a web page list to fecilitate location and retrieval of relevant pages based on 
content. 

[0011] The foregoing objects are achieved as is now described. Each entry within a "hotlisf * 
(bookmark, favorites, or history Ust) includes a set of keywords topically identifying the content 
of the linked page. The keywords included within META tags for HTML files of an accessed 
Web page may be automatically detected and extracted by the browser for the hothst when the 
browser creates the entry for that page within the hotlist. Alternatively, the browser may tabidate 
terms Scom the page content itself to select keywords identifying the content* In either case, the 
user may selectively edit the keywords for a hotUst entry to add, modify, or delete keywords. 
The keywords are stored with the hotlist data and may be searched for a particular topic, Hothst 
entries matching the search criteria are displayed to the user and, upon selection of a particular 
matching entry by the user, employed to automatically generate a request for the coTresponding 
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page. 

[0012] The above as well as additional objectives, features, aad advantages of the present 
invention will become apparent in the following detailed written description. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0013] The novel features believed characteristic of the invention are set forth in the 
upended claims. The invention itself however, as well as a preferred mode of use, further 
objects and advantages thereof, will best be understood by reference to the following detailed 
description of an illustrative embodiment when read in conjunction with the accompanying 
drawings, wherein: 

[0014] Figure 1 depicts a data processing system network in which a preferred embodiment 
of the present invention is implemented; 

[00X5] Figures 2A-2B are data structures for bookmarks and history lists in accordance with 
a preferred emb odiment of the present invention; 

[0016] Figure 3 depicts a mock-xip of a user-interface for viewing and^or editing keywords 
associated with a hotlist entry in accordance with a preferred embodiment of the present 
invention; 

[0017] Figures 4A-4C are high-level flowcharts for processes of creating and maintaining 
hotlist entry keyword fields in accordance with a preferred embodime[nt of tiie present invention; 

[0018] Figure 5 depicts a mock-up of a user interface dialog enabling search of hothst 
entries utilizing content-identifying keywords in accordance with a preferred embodiment of the 
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present invention; and 

[0019] Figure 6 is a high-level flowchait for a process of searching, retrieving, and 
displaying hotlist entries based on keyword data within the hotlist entries in accordance with the 
preferred embodiment of the present invention. 
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DETAILED DESCRIPTION OF THE PBEFEBBED EMBODIMENT 

[0020] With reference now to the figures, and in particular with reference to Figure 1, a data 
processing system network in which a preferred e^nboduxtepot of the present invention is 
implemented is depicted. The data processing system network 102 includes a number of senrers 
104a-104n (where n is any positive integer) storing files including World Wide Web content. 
Servers 104a-104n are capable of selectively providing pages, typically HyperText Markup 
Language (HTML) documents and accompanying graphics, sound, or video files, in response to 
HypeiText Transfer Protocol (HTTP) requests from clients 106a-106n coupled to servers 104a- 
104n via Internet 108, 

[0021] As known in the art, content within servers 104a-104n is retrieved utilizing Unifonn 
Resource Locators (URLs) providing an address mapping to the content. Each LIRL typically 
includes a resoxirce identifier (e.g., "www"). ^ second level domain name (e.g., "ibm")* and a top 
level domain name (e.g., ".com"), and may optionally include a directory path and/or filename 
for the content being requested. 

[0022] As known ui the art and illustrated for chent 106ll, each individual chent 106a-106n 
typically includes an operating system 110 facilitatmg the interactiojx between the system 
hardware and various software programs and one or more user appKcations 112, which, may 
include all manner of network and non-network dependent applications ranging from 
spreadsheets to Intemet telephony. In the present invention, one of the programs executing 
within the chejxt 106n is a web browser 114 or similar data access and retrieval appUcation. 
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[0023] The client 106x1 and browser 114 in the exemplary embodiment are intended for 
shared use by multiple users. Among other constituent parts, the web browser 114 will typically 
have a file aad^or directory 116 for user data and preferences for a particular user, as well as a 
data structure 118 containing the user's bookmarks and a second data structure 120 containing 
the user*s browsing history. In the present invention, both the boobnarks list 118 and the history 
list 120 contain keyword data 122 and 124, respectively, associated with URLs within the lists as 
described in further detail below. The term "hotlisf' is employed herein to refer to any 
bookmarks, histories, or favorites lists, channels, or any other data structure containing a list of 
UKLs pointing to selected content 

[0024] Referring now to Blgnres 2A and 2B, data structures for bookmarks and history lists 
in accordance with a preferred embodiment of the present invention are illustrated* Figure 2A 
illustrates a bookmarks list 118, a listing of URLs which a user has selected for ready reference. 
Each entry within bookmarks list 118 includes a name 202 identifying generally the site which is 
linked (e.g., "IBM Home Page'O, an URL 204 providing the link to the referenced content, an 
add date 206 identifying the date on which the corresponding bookmark was added to bookmarks 
list 118, aad a last visited date 208 identifying the date on which the referenced content was last 
accessed by the user. Each entry also include a description 210 and keywords 122. 

[0025] Description 210 is typically an intelligible statement regarding the content referenced 
or the nature of the enterprise which published the reference content. Description 210 is usually 
captured firom the referenced content when a bookmark to the referenced content is created. For 
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example, the IBM Home Page includes a "description" meta tag contaixdng act intelligible 
statement regarding the tiatiLre of the page and the enteiprise publishing the page: 

<meta natne?="DESCRIPTION" content='Tlie IBM corporate home page, entry point to 
information about IBM products and services"/> 

This description may be captured when the liuk is added to bookmajdks list 118, and stored 
within description field 210 of the corresponding entcy, Altematively, the content of description 
field 210 may be manually entered by the user. Description 210 may be employed, for example, 
to provide a "flyover" pop-xjp description for the bookmark entry displayed whwiever a pointer- 
driven cursor overlies a display region in which the bookmark entry is displayed, 

[0026] Keywords 122 is a set of tejcxjos provided to identify the content of the corresponding 
bookmarks entry, generally in an arbitrary and unintelligible list of terms* As with description 
210, keywords 122 may be captured from the meta tags of content referenced by a bookmark 
entry at the time the bookmark entry is created: 

<meta name='*KEYWORDS" value="IBM Microelectronics analog mixed signal asics 
ceramic probe cards communications networking ics digital video mpeg electronic card assembly 
test foimdry services palette dacs ieee 1394 phy transceivers infrared technology memory mwave 
dsp packaging interconnect embedded controllers powerpc serial storage architecture x86 
microprocessors" content==''> 

Such keywords are employed by Internet search engines to index sites, but are not 
captured within a field for a hotlist entry as proposed by the present invention. 
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[0027] Alternatively, keywords 122 may be subject to manual entry or editing by the user. 
Unscmpulous enterprises ofleji misrepresent content in order to attract more "hits." For 
example, a site may include a reference to **sex" or the name of a competixig enterprise within the 
keyword meta tag of the content published, in an effort to draw hits from keyword searches 
unrelated to or not intended to identify the corresponding site. For this reason, user editiag 
capability over keywords 122 is preferred. 

[0028] Figure 2B illustrates a history list 120, a rolling list of the last n sites (where n is any 
positive integer) visited by the user or of sites visited by the user within the defined period (e.g., 
the last 30 days), which is automatically compiled by the browser. History list 120 also includes 
a iiame 212 and a corresponding URL 214 for each entry. In addition, each entry may inoliMle a 
first visited date 216 identifying the first time the user visited the corresponding site, a last 
visited date 218 identifying the most recent visitation by the user, and an expiration date 220 
identifying a date on which the corresponding entry will be deleted, where aging is employed as 
part of the browsing history algorithm. Typically, history list 120 may be sorted for display 
utilizing any of the previously described fields within each entry, 

[0029] History list 120 in the present invention includes keywords 124, an arbitrary (with 
respect to the rules of grammar) listing of terms defining the content referenced by the 
corresponding entry. As with keywords 122, keywords 124 may be automatically captured from 
the meta tags of the link content when the corresponding entry within history list 120 is created. 
Keywords 124 (and keywords 122) may be automatically updated whenever the linked site is 
revisited to reJQect changes in the content published at the referenced URL. 
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[0030] In addition to capture jfrom txieta tags or manual user entry/editing, keywords 122 or 
124 within a hotlist may be generated from the referenced content itself. The HTML text for the 
content may be processed to generate a list of terms which appears within the content, which 
may be filtered based on frequency of occurrence within the content, uniqueness, or any other 
criteria. The resulting terms may be employed in lieu of, or in addition to, any terms captured 
from the met a tags of the content or entered by the user within keywords 122 or 124. 

[0031 J Although only a bookmarks list and a history list are depicted in Figures 2 A and 2B, 
keywords may be employed with any type of hotlist, and the techniques for selecting keywords 
may be employed regardless of the type of hotlist in which an entry is created. 

[00321 With reference now to Figure 3, a mock-up of a user-interface for viewing and/or 
editing kejwords associated with a hothst entry in accordance with a preferred embodiment of 
the present invention is depicted. The user interface dialog 300 is generated in response to a user 
selecting a hotlist entry and actuating an edit or "view properties" function* The hotlist entry 
being viewed and/or edited in the example shown is a bookmarks list entry. Accordingly, the 
name 202, XJRL 204, description 210 and keywords 122 are all displayed witbitt the user dialog 
300, All of the display fields for name 202, URL 204, description 210 and keywords 122 
support editing capability, allowdng the user to edit the contents of those display fields. 

[0033] Referring now to Figures 4A through 4C, high level flow charts for processes of 
creating and maintaining hotlist entry keyword fields in accordance with a preferred embodiment 
of the present invention are illustrated. Figure 4A illustrates the process of generating a 
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keyword field for a newly created hotlist entry. The process begins at step 402, which depicts 
creation of a hotHst entry for content referenced by aa URL. The process first passes to step 404, 
which illustrates, after retrieving the linked content, if necessary, a determination of whether the 
linked content contains a keyword meta tag. 

[0034] While existing keyword meta tags may be employed, a novel HTML meta tag such 

as: 

<meta name='TIOTLISTKEYWORDS" value=", ..conten1=*'> 
may alternatively be employed. Such a meta tag would contain keywords specifically intended 
by the content pubUsher to be automatically extracted by a browser or similar HTML facility and 
included within a hotlist data structure to facilitate location of the desired hnk by the user at a 
later time. 

[0035] If the linked content contains keyword meta tags, the process proceeds to step 406, 
which depicts extracting the keyword terais fix)m the content meta tags. From step 406, or fix)m 
step 404 if the linked content does not contain keyword meta tags, the process may optionally 
proceed to step 408, which illustrates scanning the hnked content for unique or fi-equently 
recurring terms. A dictionary of "common" terms may be employed for this purpose, along with 
recurrence threshold(s). The content scanned may include meta data or only displayed 
contented, but should preferably include aU content data so that images, sound files and motion 
picture files may also be indexed within the keywords by Iheir name or description. 
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[0036] If step 408 is implemented, the process may then optionally pass to step 410^ %vhich 
depicts filtering and/or augmenting the meta tag keywords from the keyword meta tags (if any) 
utUizittg the content keywords identified by scanning the content. In this manner, intentionally 
misleading or misdescriptive keywords included within the keyword meta tags by the publisher 
may be eliminated if the tenns do not also appear within content itself. Alternatively, keywords 
for linked content may be sytxthesized from the content where no keyword meta tags are included 
within the published content, and meta tag keyword terms may be supplemented by content- 
derived terms- 

[0037] The process then passes to step 412, which illustrates entering the selected keyword 
terms (meta tag keyword terms, content-derived keyword terms, or come combination thereof), if 
any, into the keyword field of the botlist entry just created. The process then passes to step 414, 
which illustrates the process becoming idle until another hotlist entry is created. 

[0038] Figure 4B illustrates a process of automatically updating the keyword field of a 
hotlist entry. The browser or other HTML facility for which the hotlist is employed may allow 
the user to selectively designate the hotlist, or particular entries within the hotlist, for automatic 
updating whenever the content linked by a corresponding entry within the hotlist is revisited. 
The process begins at step 416, which illustrates the user revisiting content linked by an entry 
within the hotUst, the user may access the content through the hotlist entry, in which case 
triggering of the update process (e.g., as part of retrieval of the content) is straightforward. 
Alternatively, the user may access the content by manual entry of the URL or from a link within 
other content. In this latter instance, the XIRL of the content accessed must be compared to each 
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of the URI^ within the hotlist entries in order to determine whether the update process should be 
triggered- 

[00391 The process first passes to stqp 418, which illustrates a determination of whether the 
linked content has been modified since the last access of the content by the user. This may be 
determined firojn comparison of a "last visited" field within the hotlist entry with an update meta 
tag within the linked content, such as: 

<meta name=^XASTUPDATE" value=^Tue Apr25 18:43:34 2000"I> 
The need for a keyword update for the linked content may be presumed if the content has 
been updated siuce it was last accessed by the user* Alternatively, each entry within the hotlist 
may include a field in which the last update for the linked content, as last accessed by the user, is 
maintained. 

[0040] If the content has been updated since the user's last visit, or the need for a keyword 
update for the hothst entry is otherwise determined to exist, the process proceeds to step 420, 
which depicts updating the kejrwords for the hotlist entry. Any or all of steps 404 through 412 
firom Fignre 4A may be employed for this purpose. It should be noted that the keyword field 
within a hotlist entry may optionally be segregated into "fixed" keywords for the content, 
selected by ttie user and not updated with each visit, and "dynamic" keywords which are updated 
(and deleted or overwritten, if necessary) whenever the linked content is found to have changed. 
From step 420, or from step 418 if the lioked content is unchanged since last accessed by the 
user, the process proceeds to step 422, which illustrates the process becoming idle until another 
site linked by a hothst entry is accessed by the user. 
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[00411 Figure 4C illustrates the process of updating keyword- data based on manual editing 
by the user. The process begins at step 424, which depicts the properties of a hotlist entry being 
edited by the user* The process first passes to step 426, which illustrates displaying the current 
keyword terms from the selected hotUst entry to the user, and tiiea to step 428, which depicts 
receiviiig user specified keyword terms (e.g., an edited list ficoxa the display) for the entry. The 
process then passes to step 430, which illustrates entering the received keyword terms within the 
keyword field of the hotUst entry being edited, and then to step 432, which illustrates the process 
becoming idle until another hothst entry is edited by the user. 




/ [0042] With reference now to Figure 5^ a mock-up of a user i»^^ee dialog enabling search 
hotlist entries utilizing content-identifying key\$0f^ iaf'accordance with a preferred 
embodiment of the present invention is depicted: Diaj0^5O[[O]]2 allows the user to search the 
keyword substructures of a hotlist for $«Iected^e^ch terms. Dialog 50[[0]]2 allows the user to 
enter one or more search tecm(s^ ;^thiti ^js^^rimary keyword query term blonlca input-boxes 504, 
and to select logical oper^rs usmg a logical operator blank 506 for logical linking of terms 
submitted in the keyr^rd qu^i^ Any number of keyword term blanlcs input-box 504 and logical 
operator blnnko ifeput^broes 5 06 may be provided to the user, tod the nximber of provided 
keyword tejm bUmks input-boxes 504 and logical operator blanks input-boxes 506 may be 
expande^^^n those provided are all utiUzed. 

[0043J Upon actuation of the "Search" user control, a search engine (not shown) then 
searches the keyword data stored in the keyword section of the hotlist data structure. The user 
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will typically be able to include, in additton to the search term or search terms, a series of lo^cal 
operators that specify how the terms can be combined, for instance, the iiser will typically be 
able to combine search terms with a 'logical and* operator when more than one term needs to be 
present to indicate a vahd search resvilt. The user will typically also be able to combine search 
terms with a logical or' operator when one tern out of several possible search terms needs to be 
present to indicate a vahd search result. The user will typically also be able to combine search 
terms with a 'logical except' operator when the presence of some search terms must accompany 
the absence of other terms in order to indicate a valid search result. Other logical search terms 
may be available and will Ml within the scope of the present invention. 

(0044) Referring to Figure 6, a high-level jaowchart for a process of searching, retrieving, 
and displaying hotlist entries based on keyword data within the hotlist entries in accordance with 
the preferred embodiment of the present invention is illustrated. The process begins, at step 602, 
which depicts receipt of a hotlist entry search request. The process first passes to step 604, 
which illustrates displaying the search dialog to the user and receiving the search term(s) and 
logical operators (if any) £ix>m the user through the dialog. 

[0045] The process passes next to step 6065 which depicts identifying matching hotlist 
entries by comparixig the logical search criteria to keyword terms associated with each hotlist 
entry and determining which hotlist entries, if any, satisfy the search criteria. The hothst entries 
which are identified as satisfying the search criteria are then sorted and displayed. The process 
next passes to step 608, which illustrates a determination of whether a displayed hotlist entry 
(which satisfies the specified search criteria) has been selected by the user. If so, the process 
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proceeds to step 610, which depicts closing (or roiniinizing) the display and accessing th 
requested content utilizing the URL within the hotlist entry selected by the user. 

[0046] If the user does not select a displayed hotlist entry, the process proceeds instead to 
step 612, which illustrates a detennination of whether a request to close the display has been 
received (for example, by the user actuating a display control). If not, the process returns to step 
608 to continue polling for selection of a displayed hotlist by the user. If so, however, the 
process proceeds instead to step 614, which depicts closing the display of hotlist entries which 
satisfy the search criteria, and then to step 616, which illustrates the process becoming idle imtil 
another hotlist entry keyword search is received. 

[0047] The present invention provides an improved system of storage and retrieval for 
boolanarlcs in computerized information systems and grants the ability to locate relevant 
bookmarks based on the content or implications of the file represented by the bookmark. Still 
more particnlarly, the present invention displays of bookmarks related to a particular search term 
by use of a keyword data stjiicture and a search and retrieval engine. It also provides a 
particularly distinctive facility for automatically extracting keyword data from the content of the 
web page and placing it in the bookmark file for later search.' 

[0048] It is important to note that while the present ijxventton has been described in the 
context of a fiilly functional data processing system and/or network, those skilled in the art will 
appreciate that the mechanism of the present invention is capable of being distributed in the form 
of a computer usable medium of instructions in a variety of forms, and that the present iavention 
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applies equally regardless of the particular type of sigaal bearing medium used to actually carry 
out the distributiojL Examples of computer usable mediums include: nonvolatile, hard-coded 
tj^e mediums such as read only memories (ROMs) or erasable^ electrically programmable read 
only memories (EEPROMs), recordable type mediums such as floppy disks, hard disk drives and 
CD-ROMs, and transmission type mediums such as digital and analog coiumunication links. 

[0049] While the invention has been particularly shown and described with reference to a 
preferred embodiment, it will be understood by those skilled in the art that various changes in 
form and detail may be made therein without departing from the spirit and scope of the 
invention. 



AUS920000120US1 



-\9- 



PAGE 33/43 * RCVD AT 4/15/2004 3:01:53 PM [Eastern Daylight Time] * SVR:USPTO-EFXflF-1/0 ' DNIS:8729306* CSID:5123436446 * DURATION (mnKS):11-24 



